Patent 

File 265.0026 0101 



DRUG DISCOVERY METHODS 



10 



15 



CONTINUING APPLICATION DATA 
This application claims the benefit of U.S. Provisional Application Serial 
No. 60/240,187, filed October 13, 2000, which is incorporated by reference herein. 

GOVERNMENT FUNDING 
The present invention was made with government support under Grant No. 
DAAD 19-01 -1-0361, awarded by Defense Advanced Research Projects Agency. 
The Government has certain rights in this invention. 



BACKGROUND 

Pathogenic viruses, bacteria and bacterial toxins, fungi, and parasites are the 
cause of serious health problems for both humans and animals, and potential agents 
for biological and agricultural warfare and terrorism (BWT). Effective vaccines and 

20 therapeutic drugs are not available for the vast majority of these pathogens. At 

present, the medical and public health response to a pathogenic infections relies on 
minimizing transmission and providing palliative care, and treatment when 
appropriate with anti-bacterial or anti-fungal agents. Developing fast-acting 
therapeutic compounds to inhibit the replication of many dissimilar pathogens is 

25 essential to improving human and animal health, and the ability to respond to a 
BWT incident. 

Previous approaches to discovering and developing antimicrobial 
countermeasure have relied on (1) screening chemical compounds with little or no a 
priori knowledge of their potential action or targets, and (2) screening combinatorial 
30 libraries of random amino acid or nucleotide sequences for compounds that bind 
tightly to a favored target biomolecule. These approaches are extremely costly and 
time-consuming, but rarely successful. Moreover, these approaches are generally 
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specific to only one pathogen or a closely related group of pathogens. 

Several complementary studies have recently shown that small in vivo 
combinatorial libraries can be used to select peptides that produce a phenotypic 
response. These responses included inhibition of the yeast spindle checkpoint and 
increased mating pheromone response (Norman et al., Science, 285, 591-595 
(1999)), and binding to the REV responsive element of the human 
immunodeficiency virus RNA (Tan and Frankel, Proc. Natl Acad. Set USA., 95, 
4247-4252(1998)). 

SUMMARY OF THE INVENTION 

The present invention represents an advance in the art of discovering drugs 
that can be used to prevent morbidity and mortality associated with pathogens and 
toxins, including, for instance, highly lethal agents that could be used in biological 
warfare. Despite extensive effort, effective countermeasures against agents that can 
be used in biological warfare threats do not yet exist. Current approaches to 
developing antiviral compounds include screening natural products or chemicals in 
vivo, combinatorial organic synthesis, high throughput in vitro screening, and X-ray 
analysis of ligand-protein co-crystals. These conventional approaches are extremely 
costly and time-consuming, but rarely successful. The present invention provides a 
technology which rapidly generates medical countermeasures and identifies 
validated molecular targets essential for pathogen replication. These 
countermeasures are designed for use before or after exposure to pathogens and 
toxins to prevent disease. 

The present invention is directed to a collection of polypeptides that includes 
at least two polypeptides. Each polypeptide includes a fragment of SEQ ID NO:l 
beginning at any amino acid from about 1 19 to about 124 and ending at any amino 
acid from about 258 to about 275. At least two consecutive amino acids within the 
regions of amino acids 129-137, or amino acids 182-189, or amino acids 257-264 as 
depicted at SEQ ID NO:l are replaced by an amino acid sequence that includes 
Xaa n , wherein n is from about 5 to about 21, and each Xaa is independently a 



random amino acid. Two examples of polypeptides that are members of one 
collection are SEQ ID NO:33 and SEQ ID NO:34. Each member of the collection 
may further include a cell-permeant region fused to the amino terminal end of the 
polypeptide. Preferably, the cell-permeant region includes an amino acid sequence 
5 YGRKKRRQRRR (SEQ ID NO:2), RQIKIWFQNRRMKWKK (SEQ ID NO:3), 
RQIKIWFPNRRMKWKK (SEQ ID NO:4), or RQPKIWFPNRRPKWKK (SEQ ID 
NO:5). The invention is further directed to a cell that includes a member of the 
collection of polypeptides, and a population of cells that includes two or more cells, 
wherein each member of the population includes one polypeptide of the collection 

10 of polypeptides. 

The invention also provides a polypeptide selected from the group consisting 
of an amino acid sequence SEQ ID NO:2 fused to an amino terminal end of a 
fragment of SEQ ID NO:l beginning at any amino acid from about 1 19 to about 124 
and ending at any amino acid from about 258 to about 275, wherein at least two 

1 5 consecutive amino acids within the regions of amino acids 1 29- 1 37, or amino acids 
182-189, or amino acids 257-264 as depicted at SEQ ID NO: 1 are replaced by an 
amino acid sequence including Xaa„, wherein n is from about 5 to about 21, and 
each Xaa is independently a random amino acid. The polypeptide may further 
include a cell-permeant region fused to the amino terminal end of the polypeptide. 

20 The invention is further directed to a cell that includes a member of the collection of 
polypeptides. 

Further provided by the invention is a collection of polynucleotides 
including at least two polynucleotides. Each polynucleotide includes a coding 
sequence encoding a polypeptide that includes a fragment of SEQ ID NO: 1 

25 beginning at any amino acid from about 1 1 9 to about 124 and ending at any amino 
acid from about 262 to about 275, wherein at least two consecutive amino acids 
within the regions of amino acids 129-137, or amino acids 182-189, or amino acids 
257-264 as depicted at SEQ ID NO:l are replaced by an amino acid sequence 
including Xaa„, wherein n is from about 5 to about 21, and each Xaa is 

30 independently a random amino acid. The polypeptide may further include a cell- 
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permeant region fused to the amino terminal end of the polypeptide. The nucleotide 
sequence of the coding sequence encoding the Xaa^ may consists of a nucleotide 
sequence NNK m , wherein N is independently a random nucleotide, K is 
independently a guanine or a thymine, and wherein o is from about 5 to about 21. 
5 The polynucleotide may be present in a vector, for instance, a retrovirus vector. 

The invention is further directed to a cell that includes a member of the collection of 
polynucleotides, and a population of cells that includes two or more cells, wherein 
each member of the population includes one polynucleotide of the collection of 
polynucleotides. 

10 The present invention provides a method for crystallizing a polypeptide that 

includes an amino acid sequence SEQ ID NO:l. The method includes preparing 
purified polypeptide that includes an amino acid sequence SEQ ID NO:l at a 
concentration of about 3 mg/ml to about 20 mg/ml, and crystallizing the polypeptide 
from a solution containing about 20 % by weight to about 28 % by weight 

15 polyethylene glycol, about 0.05 M to about 0.2 M ammonium sulfate, and about 1 
mM to about 20 mM urea, wherein the solution is buffered to a pH of about 6 to 
about 8. Another method for crystallizing a polypeptide that includes an amino acid 
sequence SEQ ED NO:l includes preparing purified polypeptide that includes an 
amino acid sequence SEQ ID NO: 1 at a concentration of about 3 mg/ml to about 20 

20 mg/ml, and crystallizing the polypeptide from a solution that contains about 15 % 
by weight to about 25 % by weight polyethylene glycol 4000, and about 0.05 M to 
about 0.4 M MgC12, wherein the solution is buffered to a pH of about 6 to about 8. 

The invention is further directed to a crystal of a polypeptide that includes 
an amino acid sequence SEQ ID NO:l. Preferably, the crystal has the space group 

25 symmetry P2 1 2j2 I . Preferably, the crystal includes a unit cell having dimensions of 
a, b, and c; wherein a is about 69.3 A to about 72.0 A, b is about 75.2 A to about 
76.0 A, and c is about 90. 1 A to about 94.7 A; and wherein a = p = y = about 90°. 

Also provided by the invention is a method for identifying a polypeptide 
within a collection that prevents cell death after exposure to a pathogen or a toxin. 

30 The method includes providing a cell that contains a polypeptide that is a member 
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of a collection of polypeptides including at least two polypeptides. Each 
polypeptide includes a fragment of SEQ ID NO: 1 beginning at any amino acid from 
about 1 19 to about 124 and ending at any amino acid from about 258 to about 275, 
wherein at least two consecutive amino acids within the regions of amino acids 129- 
5 137, or amino acids 182-189, or amino acids 257-264 as depicted at SEQ ID NO:l 
are replaced by an amino acid sequence including Xaa^ wherein n is from about 5 
to about 21, and each Xaa is independently a random amino acid.. The cell is 
exposed to a pathogen or a toxin, and whether the polypeptide prevents cell death is 
determined by incubating the cell under conditions such that the pathogen or the 

10 toxin kills a cell that does not include a polypeptide that prevents cell death after 
exposure to a pathogen or a toxin. The presence of a cell that proliferates indicates 
the polypeptide prevents cell death after exposure to a pathogen or a toxin. The 
pathogen may be, for instance, a virus or a microbe. Examples of microbes include 
a bacterium, a rickettsia, and a fungus. Examples of toxins include a biological 

1 5 toxin or a chemical toxin. 

The invention provides a method for identifying a polypeptide within a 
collection that binds a pathogen, a toxin, a polypeptide, or a polynucleotide. The 
method includes providing a cell that includes a polypeptide that is a member of a 
collection of polypeptides including at least two polypeptides. Each polypeptide 

20 includes a fragment of SEQ ID NO: 1 beginning at any amino acid from about 1 19 
to about 124 and ending at any amino acid from about 258 to about 275, wherein at 
least two consecutive amino acids within the regions of amino acids 129-137, or 
amino acids 182-189, or amino acids 257-264 as depicted at SEQ ID NO:l are 
replaced by an amino acid sequence including Xaa n , wherein n is from about 5 to 

25 about 21, and each Xaa is independently a random amino acid. The cell is exposed 
to a pathogen or a toxin, and whether the polypeptide prevents cell death is 
determined by incubating the cell under conditions such that the pathogen or the 
toxin kills a cell that does not include a polypeptide that prevents cell death after 
exposure to a pathogen or a toxin. The presence of a cell that proliferates indicates 

30 the polypeptide binds the pathogen, the toxin, a polypeptide, or a polynucleotide. 
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The pathogen may be, for instance, a virus or a microbe. Examples of microbes 
include a bacterium, a rickettsia, and a fungus. Examples of toxins include a 
biological toxin or a chemical toxin. 

5 BRIEF DESCRIPTION OF THE FIGURES 

Figure 1. Amino acid sequence (SEQ ID NO:l) of amino acids 1 19-275 of the 
Venezuelan equine encephalitis (VEE) virus capsid polypeptide carboxy terminal 
portion and a nucleotide sequence (SEQ ED NO: 12) that encodes SEQ ID NO: 1 . 

10 Figure 2. 2 A, Specific examples of members (Adaptein-1 and Adaptein-2) of a 
collection of polypeptides of the present invention. 2B, an alignment of the 
Adaptein nucleotide sequences with the CCD nucleotide sequence. A-l, Adaptein- 
1; A-2, Adaptein-2; CCD, amino acids 1 19-275 of VEE virus capsid polypeptide 
carboxy terminal portion; HindDI and Xhol, restriction endonuclease sites; dashes 

15 indicate an absence of a nucleotide. Figure 2C, an alignment of the Adaptein amino 
acid sequences with the CCD amino acid sequence. Dashes indicate an absence of 
an amino acid. 

Figure 3. Nucleotide sequence (SEQ ID NO:6) encoding the tat-CCD fusion 
20 polypeptide and predicted amino acid sequence (SEQ ID NO:7). 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS OF THE 

INVENTION 

25 Compounds 

The present invention provides collections of polypeptides. A collection of 
polypeptides is also referred to herein as a library, and as an adaptein library. As 
used herein, "polypeptide" refers to a polymer of amino acids linked by peptide 
bonds and does not refer to a specific length of a polymer of amino acids. Thus, for 
30 example, the terms peptide, oligopeptide, protein, and enzyme are included within 
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the definition of polypeptide. This term also includes post-expression modifications 
of the polypeptide, for example, glycosylations, acetylations, phosphorylations and 
the like. In some aspects of the invention, a polypeptide is isolated. As used herein, 
an "isolated" polypeptide or polynucleotide means a polypeptide or polynucleotide 
5 that has been either removed from its natural environment, produced using 
recombinant techniques, or chemically or enzymatically synthesized. In some 
aspects of the invention, the polypeptide is preferably purified. As used herein, a 
"purified" polypeptide or polynucleotide means a polypeptide or polynucleotide that 
is essentially free from any other polypeptide or polynucleotide and associated 
10 cellular products or other impurities. As used herein, a "collection" of polypeptides 
or polynucleotides is a population of at least two polypeptides or polynucleotides, 
where the population includes regions of amino acids or nucleotides that are 
identical in each member of the population, and a region of amino acids or 
nucleotides that are not identical in each member of the population. Unless 
15 otherwise specified, "a," "an," "the," and "at least one" are used interchangeably and 
mean one or more than one. 

Each polypeptide of a collection includes a fragment of an amino acid 
sequence having a peptide backbone conformation that acts to display a variable 
amino acid sequence on the surface of the polypeptide. Variable amino acid 
20 sequences are described in greater detail herein. A fragment of an amino acid 

sequence having this peptide backbone conformation is also referred to herein as a 
carrier polypeptide or scaffold polypeptide. A preferred example of such an amino 
acid sequence is a carboxy terminal portion of the Venezuelan equine encephalitis 
(VEE) virus capsid polypeptide. An example of the amino acid sequence of the 
25 VEE capsid polypeptide is available at GenBank Accession Number L01443. A 
preferred carboxy terminal portion of the VEE capsid polypeptide, also referred to 
herein as "CCD," begins at about amino acid 1 19 and ends at about amino acid 275, 
and is depicted at SEQ ID NO:l (see Figure 1). Amino acids 1 19-275 of the CCD 
are encoded by nucleotides 7916-8386 of GenBank Accession Number L01443, and 
30 are shown in Figure 1. This amino acid sequence forms a trypsin resistant and 
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chymotrypsin-resistant structure of predominantly p-sheets, with small loops 
connecting sequential strands. 

Accordingly, in one aspect of the invention, a collection of polypeptides 
includes at least two polypeptides, where each polypeptide includes a fragment of 
5 SEQ ID NO: 1 . The fragment begins at any amino acid from about 1 19 to about 
124, preferably, about 1 19. The fragment ends at any amino acid from about 258 to 
about 275, preferably, about 275. The fragment further includes a variable amino 
acid sequence, which is described in detail herein. The variable amino acid 
sequence replaces from about 1 to about 4, more preferably, from about 2 to about 

10 3, most preferably, about 2, amino acids within 1 of 3 regions of the fragment. The 
3 regions are amino acids 129-137, amino acids 182-189, and amino acids 257-264 
of SEQ ID NO:l. Preferably, a variable amino acid sequence replaces amino acids 
within the third region, i.e., amino acids 257-264. Preferably, the amino acids 
within the third region that are replaced by the variable amino acid sequence are 

1 5 amino acids 260-26 1 of SEQ ID NO: 1 . 

Another example of a fragment of an amino acid sequence having a peptide 
backbone conformation that acts to display a variable amino acid sequence on the 
surface of the polypeptide is the inactive staphylococcal nuclease (SNase) 
polypeptide 

20 (KETAAAKFERQHMDSSTSAASSSNYCNQMMKSRNLTKDRCKPVNTFVHES 
LADVQAVCSQKNVACKNGQTNCYQSYSTMSITDCRETGSSKYPNCAYKTT 
QANKHUVACEGNPYVPVHFAASV, SEQ ID NO:8, depicted at GenBank 
Accession Number 3402176) . The variable amino acid sequence replaces from 
about 1 to about 4 amino acids from the region of amino acids 19-27 of SEQ ID 

25 NO:8. 

The variable amino acid sequence that is used to replace amino acids of a 
fragment of SEQ ID NO: 1 has the amino acid sequence Xaa n . The "n" can be, in 
increasing order of preference, from about 5 to about 21, from about 6 to about 18, 
from about 6 to about 12, most preferably, about 6. Each Xaa is independently any 
30 amino acid, preferably one of the 20 natural amino acids. Thus, in a single 
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collection of polypeptides, each member of the collection has a variable amino acid 
sequence that has the same number of amino acids (i.e., about 5 to about 21) but a 
different amino acid sequence. Accordingly, in the aspect of the present invention 
where amino acids 260-261 are replaced by Xaa n and n is 6, a polypeptide of a 
5 collection of polypeptides has the following amino acid sequence: amino acids 1 19- 
259 as depicted at SEQ ID NO: 1 followed by the amino acid sequence 
XaaXaaXaaXaaXaaXaa (SEQ ID NO: 11) followed by amino acids 262-275 as 
depicted at SEQ ID NO: 1 . 

The present invention is also directed to individual members of the 
10 collections of polypeptides. The number of members of a collection of polypeptides 
is large and varies as a function of the value of "n" in the variable amino acid 
sequence Xaa„. The number of members of a collection of polypeptides is also 
finite, and the amino acid sequence of each member can be readily determined by 
one skilled in the art by methodically changing one amino acid at a time in the 
15 variable amino acid sequence. 

Typically, the variable amino acid sequence is flanked on either side by two 
amino acids. These two amino acids on each side are present as a result of 
introducing 2 restriction endonuclease sites into the nucleotide sequence encoding a 
polypeptide of the present invention. For instance, when the restriction 
20 endonuclease is Hindm, the 2 amino acids will be KL, and when the restriction 
endonuclease is Xhol, the 2 amino acids will be LE. 

Optionally and preferably, the variable amino acid sequence is flanked by a 
linker amino acid sequence. A linker can be used to provide some conformational 
freedom to the variable amino acid sequence, and/or to allow the variable amino 
25 acid sequence to be spatially separated from the fragment amino acid sequence. 
Preferably, a linker includes from about 1 to about 4 amino acids, more preferably, 
about 2 to about 3 amino acids, most preferably, 3 amino acids. Examples of 
linkers include Ser-Ser-Gly, Ser-Gly-Ser, Gly-Ser-Gly, and Ser-Gly. 

Optionally and preferably, each polypeptide of a collection of polypeptides 
30 also includes a cell-permeant region. As used herein, a "cell-permeant region" is a 
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polypeptide that causes polypeptides to which it is fused to traverse cell membranes, 
including cultured cells, and cells present in animals including cells of the blood- 
brain barrier (see, for instance, Nagahara et al., Nat. Medicine, 4, 1449-1452 (1998), 
Schwarze et al., Science 285, 1569-1572 (1999), and Vocero-Akbani et al., Meth. 
5 Enzym., 322, 508-521 (2000)). As used herein, two polypeptides are "fused" when 
they are covalently bound by a peptide bond. Preferably, the cell-permeant region 
is present at the amino terminal end of the polypeptide, fused to the amino terminal 
amino acid of a fragment amino acid sequence. Preferred examples of cell- 
permeant regions include YGRKKRRQRRR (SEQ ID NO:2), 

10 RQIKJWFQNRRMKWKK (SEQ ID NO:3), RQIKTWFPNRRMKWKK (SEQ ID 
NO:4), and RQPKIWFPNRRPKWKK (SEQ ID NO:5), preferably, the cell- 
permeant region is SEQ ID NO:2. 

Specific examples of members of one collection of polypeptides include the 
amino acid sequences depicted in Figure 2. The two examples, labeled Adaptein-1 

15 and Adaptein-2, each contain a variable amino acid sequence of 6 amino acids. In 
Adaptein-1, the sequence of the variable amino acid sequence is SPHYAQ (amino 
acids 262-267 of SEQ ID NO:33), and in Adaptein-2, the sequence of the variable 
amino acid sequence is RSGTQW (amino acids 262-267 of SEQ ID NO:34). The 
amino acids KL and LE which flank the variable amino acid sequences are encoded 
20 by the nucleotides encoding the restriction endonuclease sites Hindm and Xhol, 
respectively. 

The present invention also includes a population of cultured cells including 
two or more cells, where each cell of the population includes one polypeptide of one 
collection of polypeptides. The present invention also provides individual cultured 

25 cells containing a polypeptide that is a member of a collection of polypeptides. The 
cells may be prokaryotic or eukaryotic, preferably, eukaryotic, more preferably, 
vertebrate, most preferably, mammalian. Examples of useful mammalian cultured 
cells include 293 cells, macrophage cells, including J774 and RAW 264.7, HeLa 
cells, and Vero cells, each of which is available from the ATCC 

30 The present invention also provides collections of polynucleotides. As used 
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herein, "polynucleotide" refers to a polymeric form of nucleotides of any length, 
either ribonucleotides or deoxynucleotides, and includes both double- and single- 
stranded DNA and RNA. A polynucleotide may include nucleotide sequences 
having different functions, including, for instance, coding sequences, and 
5 non-coding sequences such as regulatory sequences. A polynucleotide can be 
obtained directly from a natural source, or can be prepared with the aid of 
recombinant, enzymatic, or chemical techniques. A polynucleotide can be linear or 
circular in topology, and can be, for example, a portion of a vector, such as an 
expression or cloning vector, or a fragment. 

10 A collection of polynucleotides encodes a collection of polypeptides of the 

present invention. Thus, each member of a collection of polynucleotides includes a 
coding sequence that encodes a fragment of an amino acid sequence, preferably 
SEQ ID NO: 1. A "coding sequence" is a nucleotide sequence that encodes a 
polypeptide and, when placed under the control of appropriate regulatory sequences, 

1 5 expresses the encoded polypeptide. The boundaries of a coding region are generally 
determined by a translation start codon at its 5 'end and a translation stop codon at 
its 3' end. 

The coding sequence further includes a nucleotide sequence encoding a 
variable amino acid sequence. Each codon of the coding sequence encoding the 

20 variable amino acid sequence may have the nucleotide sequence (NNN) m . The "m" 
can be, in increasing order of preference, from about 5 to about 21, from about 6 to 
about 18, from about 6 to about 12, most preferably, about 6. Each "N" is 
independently either adenine (A), thymine (T), guanine (G), or cytosine (C). 
Preferably, each codon of the coding sequence encoding the variable amino acid 

25 sequence has the nucleotide sequence (NNZ) m . "Z" is independently either R, Y, M, 
K, or S, where R is G or A, Y is T or C, M is A or C, K is G orT, and S is G or C. 
Most preferably, each codon of the coding sequence encoding the variable amino 
acid sequence has the nucleotide sequence (NNK) m . 

The present invention is also directed to individual members of the 

30 collections of polynucleotides. The number of members of a collection of 
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polynucleotides is large and varies partially as a function of the value of "m" in 
(NNN) m . The number of members of a collection of polynucleotides is also finite, 
and the amino acid sequence of each member can be readily determined by one 
skilled in the art. Specific examples of members of such a collection include the 
nucleotide sequences depicted in Figure 2. 

Optionally and preferably, a coding sequence of a polynucleotide of the 
present invention further includes a nucleotide sequence encoding a linker as 
described herein. Optionally and preferably, a coding sequence of a polynucleotide 
of the present invention also includes a nucleotide sequence encoding a cell- 
permeant region as described herein. 

The present invention also includes a population of cultured cells including 
two or more cells, where each cell of the population includes one polynucleotide of 
one collection of polynucleotides. The present invention also provides individual 
cultured cells containing a polynucleotide that is a member of a collection of 
polynucleotides. Preferably, when a polynucleotide of the present invention is 
present in a eukaryotic cell, the polynucleotide is inserted into the genomic DNA of 
the cell. 

A polynucleotide of the invention can be inserted in a vector. A vector can 
be used in the construction of the collection of polynucleotides described herein, as 
a way to insert polynucleotides of the present invention into the genomic DNA of a 
eukaryotic cell, and/or as a way to cause a polynucleotide of the present invention to 
express a polypeptide of the present invention. 

A vector is a replicating polynucleotide, such as a plasmid, to which another 
polynucleotide may be attached so as to bring about the replication of the attached 
polynucleotide. Construction of vectors containing a polynucleotide of the invention 
employs standard ligation techniques known in the art. See, e.g., Sambrook et al, 
Molecular Cloning: A Laboratory Manual. , Cold Spring Harbor Laboratory Press 
(1989) or Ausubel, R.M., ed. Current Protocols in Molecular Biology (1994). A 
vector can provide for further cloning (amplification of the polynucleotide), i.e., a 
cloning vector, or for expression of the polypeptide encoded by the coding region, 
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i.e., an expression vector. A vector can provide for the insertion of a polynucleotide 
into the genomic DNA of a eukaryotic cell, i.e., a suicide vector. A vector can 
provide for more than one of these functions, for instance, a vector can provide for 
expression and be a suicide vector. The term vector includes, but is not limited to, 
5 plasmid vectors, viral vectors (including retroviral vectors), cosmid vectors, or 
artificial chromosome vectors. Typically, a vector is capable of replication in a 
bacterial host, for instance E. coli. 

Selection of a vector depends upon a variety of desired characteristics in the 
resulting construct, such as a selection marker, vector replication rate, and the like. 
1 0 Suitable host cells for cloning or expressing the vectors herein are prokaryote or 
eukaryotic cells. Preferably the host cell secretes minimal amounts of proteolytic 
enzymes. Suitable prokaryotes include eubacteria, such as gram-negative or 
gram-positive organisms, for example, E. coli, Bacilli such as B. subtilis, 
Pseudomonas species such as P. aeruginosa, or Salmonella typhimurium. 
1 5 Preferably, E. coli is used. 

An expression vector optionally includes regulatory sequences operably 
linked to the coding region. A "regulatory sequence" is a nucleotide sequence that 
regulates expression of a coding sequence to which it is operably linked. 
Nonlimiting examples of regulatory sequences include promoters, enhancers, 
20 transcription initiation sites, translation start sites, translation stop sites, 

transcription terminators, and poly(A) signals. The term "operably linked" refers to 
a juxtaposition of components such that they are in a relationship permitting them to 
function in their intended manner. A regulatory sequence is "operably linked" to a 
coding region when it is joined in such a way that expression of the coding region 
25 is achieved under conditions compatible with the regulatory sequence. 

The invention is not limited by the use of any particular promoter, and a 
wide variety are known. Promoters act as regulatory signals that bind RNA 
polymerase in a cell to initiate transcription of a downstream (3' direction) coding 
region. The promoter used in the invention can be a constitutive or an inducible 
30 promoter. It can be, but need not be, heterologous with respect to the host cell. 
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Promoter sequences are known for eukaryotes and can be suitably inserted into an 
expression vector. Transcription of a coding sequence encoding a polypeptide of 
the present invention in a host cell can be controlled, for example, by promoters 
obtained from the genomes of viruses such as polyoma virus, fowlpox virus, 
adenovirus (such as Adenovirus 2), bovine papilloma virus, avian sarcoma virus, 
cytomegalovirus, a retrovirus, and Hepatitis-B virus. Preferably, the promoter is the 
human cytomegalovirus (CMV) immediate early promoter. 

Transcription of a coding sequence encoding a polypeptide of the present 
invention by a eukaryote cell can be increased by inserting an enhancer sequence 
into the vector. Enhancers are cis-acting elements of DNA, usually having about 10 
to 300 nucleotides, that act on a promoter to increase its transcription. Enhancers are 
relatively orientation- and position-independent, having been found 5' and 3' to 
coding regions, within an intron as well as within the coding sequence itself. Many 
enhancer sequences are now known from mammalian genes (globin, elastase, 
albumin, alpha-fetoprotein, and insulin). Enhancers from eukaryotic cell viruses are 
also known and include the SV40 enhancer on the late side of the replication origin, 
the cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side 
of the replication origin, adenovirus enhancers, and the enhancer present in the 
Moloney murine sarcoma virus 5' long terminal repeat. The enhancer may be 
spliced into the vector at a position 5 'or 3' to the coding region encoding a 
polypeptide of the present invention, but is preferably located at a site 5' of the 
promoter. 

The polynucleotide used to transform the host cell optionally includes one or 
more marker sequences, which typically encode a molecule that inactivates or 
otherwise detects or is detected by a compound in the growth medium. For 
example, the inclusion of a marker sequence can render the transformed cell 
resistant to an antibiotic, or it can confer compound-specific metabolism on the 
transformed cell. Examples of a marker sequence are sequences that confer 
resistance to kanamycin, ampicillin, chloramphenicol, tetracycline, puromycin, 
neomycin, and formulations of phleomycin Dl including, for example, the 
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formulation available under the trade-name ZEOCIN (Invitrogen). 

The present invention also includes crystals of a polypeptide having the 
amino acid sequence depicted at SEQ ID NO:l, which are suitable for x-ray 
crystallographic analysis. Preferably, the crystals have a P2 1 2,2 1 space group. 
5 Most preferably, the crystal includes rectangular shaped unit cells, each unit cell 
having the dimensions a= about 69.3 to about 72.0 A, preferably about 70.0 A, b= 
about 75.2 to about 76.0 A, preferably about 75.0 A, and c= about 90.1 to about 
95.0 A, preferably about 94.7 A, a = about 90, p = about 90, and y = about 90. The 
crystallized enzyme is a monomer and has three molecules in the asymmetric unit. 
1 0 Variation in buffer and buffer pH as well as other additives such as polyethylene 
glycol (PEG) is apparent to those skilled in the art and may result in similar crystals. 

The present invention is also directed to methods for crystalizing a 
polypeptide. The polypeptide may have the amino acid sequence depicted at SEQ 
ID NO: 1, or be a polypeptide of the present invention. 
1 5 A method for crystalizing a polypeptide includes growing crystals in 

hanging drops at about 18°C using about 3 mg/ml to about 20 mg/ml of 
polypeptide, preferably, about 10 mg/ml mixed with about an equal volume of 
precipitant. The volume of polypeptide used is about 1 pi to about 15 ul, 
preferably, about 5 |il. Two different crystallization conditions can be used to 
20 produce crystals. In one condition, about 20 % to about 28 %, preferably 24% 
(w/v) polyethylene glycol (PEG), about 0.05 M to about 0.2 M, preferably, about 
0. 1 M ammonium sulfate, about 1 raM to about 20 mM, preferably, about 5 mM 
urea, and about 0.02 M to about 0.3 M, preferably, about 0.1 M Bis-Tris buffer, pH 
about 6 to about 8, preferably, about pH 6.5 can be used. Preferably, the PEG is 
25 PEG 1000. In the second condition, about 15 % to about 25 %, preferably, about 20 
% (w/v) PEG 4000, about 0.05 M to about 0.4M, preferably, about 0.2 M MgCl 2 
buffered in Bis-Tris, pH about 6 to about 8, preferably, about pH 6.5 can be used. 
The crystals in each condition can be allowed to grow for about 8 to about 12 days, 
preferably, about 10 days. Crystals can be transferred to precipitant solution 
30 containing about 5 % to about 25 %, preferably, about 10 % 2,4-methylpentanediol 
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(MPD) as cryo-protectant. 
Methods of use 

The present invention is further directed to methods for identifying a 
polypeptide within a collection. In one aspect, the method identifies a polypeptide 
within a collection, where the polypeptide prevents cell death after the cell is 
exposed to a pathogen or a toxin. In another aspect, the method identifies a 
polypeptide within a cell, where the polypeptide binds a pathogen, a toxin, a 
polypeptide, of a polynucleotide. The methods include providing a cell that 
contains a polypeptide of the present invention, exposing the cell to a pathogen or a 
toxin, and determining if the polypeptide of the present invention prevents cell 
death. To determine if the polypeptide prevents cell death, the cell is incubated 
under conditions that result in the pathogen or the toxin killing those cells that do 
not contain a polypeptide of the present invention, or contain a polypeptide of the 
present invention that does not protect the cell. The presence of a living cell 
indicates the polypeptide prevents cell death after exposure to a pathogen or a toxin. 
Without intending to be limited by theory, it is expected that a variable amino acid 
sequence of the polypeptides of the present invention will protect a cell from the 
pathogen or toxin by binding to target polypeptides or nucleotides of either viral or 
host cell origin. For instance, an amino acid sequence may bind to a pathogen 
polypeptide or nucleic acid sequence and prevent replication. A variable amino 
acid sequence may interact with host cell polypeptides or nucleic acids to protect 
the cell from pathogen challenge. For example, the variable amino acid sequence 
may selectively inhibit a cellular protease required for viral protein processing, or 
may down-regulate the expression and/or transport of host cell receptors required 
for pathogen or toxin entry. 

The pathogen or toxin is preferably cytotoxic to the cell. As used herein, 
"cytotoxic" and "cytopathic" are used interchangeably and refer to the ability of a 
pathogen or a toxin to cause cell death. Preferably, a pathogen or a toxin are 
cytolytic, i.e., cause the cell contacted with the pathogen or toxin to lyse. As used 
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herein, "cell death" refers to the permanent cessation of proliferation, and is 
typically apparent as the degeneration or necrosis of the cell. As used herein, 
"proliferation" includes, but is not limited to, replication. When the pathogen or 
toxin is cytotoxic but not cytolytic, cell viability can be determined by methods 
5 known to the art, including trypan blue exclusion, propidium iodide exclusion, and 
observing the presence of colonies growing on a plate containing other cells that 
have ceased to proliferate. When the pathogen or toxin is cytolytic, cell viability 
can be evaluated by measuring the amount of cell lysis, and by observing the 
presence of colonies growing on a plate after exposure to a pathogen or a toxin. 
10 Examples of pathogens include viruses and microbes that are cytotoxic to 

cells. Without intending to be limiting, examples of viruses include members of the 
genera within the family Bunyaviridae, such as Bunyavirus, Nairovirus, 
Phlebovirus, and Hantavirus. Examples of Bunyavirus include LaCrosse virus, 
Bunyamwera virus, and Oropuche virus. Examples of Nairovirus include Crimean 

1 5 Congo haemorrhagic fever virus and Dugbe virus. Examples of Phlebovirus 
include Rift Valley fever virus, Punta Toro virus, Sandfly Sicilian virus, and 
Unkuniemi virus. Examples of Hantavirus include Hantaan virus, Sin Nombre 
virus, and Seoul virus. 

Other non-limiting examples of viruses include members of the flavivirus 

20 family, including the genus Hepacivirus (for instance, Hepatitis C virus) and the 
genus Flavivirus (Yellow fever virus, Tick-borne encephalitis complex viruses) , 
members of the family Togaviridae, genus Alphavirus (eastern equine encephalitis 
virus, Venezuelan Equine Encephalitis virus, western equine encephalitis virus), 
members of the family Filovirus (Ebola and Marburg viruses), Equine Morbillivirus 

25 (Hendra virus), members of the family Arenavirus (Lassa fever, Junin, Machupo, 
Sabia, Flexal, and Guanarito viruses), Variola major virus (Smallpox virus), human 
immunodeficiency virus, foot and mouth disease virus, and influenza virus. 

Examples of microbes that can be used in the methods described herein 
include, for instance, bacteria and fungi. Non-limiting examples of bacteria include 

30 rickettsias, including members of the genera Rickettsias (e.g., Rickettsia prowazekii, 
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and Rickettsia rickettsii) and Coxiella (e.g., Coxiella burnetii). Other examples of 
bacteria include members of the Enterobacteriaceae (including Escherichia coli, 
Shigella spp., Salmonella spp., and Yersinia spp.), Brucella spp. (for instance, 
Brucella abortus, Brucella melitensis, and Brucella suis), Bacillus anthracis, 
5 Burkholderia spp. (including Burkholderia mallei, and Burkholderia pseudomallei, 
Clostridium botulinum, Francisella tularensis. Non-limiting examples of fungi 
include, for instance, Coccidioides immitis. 

In many cases, strains of pathogens that are used as vaccines can also be 
used. Examples include the Bacillus anthracis Sterne strain (Brassier et al., Infect. 

10 Immun., 68, 1781-1786 (2000)), the MP-12 live-attenuated vaccine strain of Rift 
Valley fever virus (Caplen et al., /. Gen. Virol., 66, 2271-2277 (1985)), Yellow 
fever virus vaccine strain 17D, the TC-83 attenuated vaccine strain of Venezuelan 
equine encephalitis virus. 

Examples of toxins include biological toxins and chemical toxins. As used 

15 herein, a "biological toxin" is a toxin that is produced by a cell. Non-limiting 
examples of biological toxins include, for instance, Abrin, Aflatoxins, Anthrax 
toxin, Botulinum toxins, Clostridium perfringens epsilon toxin, Conotoxins, 
Diacetoxyscirpenol, Ricin, Saxitoxin, enterotoxins, Shigatoxin, Staphylococcal 
enterotoxins, Tetrodotoxin, and T-2 toxin. Examples of chemical toxins include, for 
20 instance, sarin. 

The cell that is exposed to a pathogen or toxin can be ex vivo or in vivo, 
preferably, ex vivo. As used herein, "in vivo" refers to a cell that is present within 
the body of an animal. As used herein, "ex vivo" refers to a cell that has been 
removed from the body of an animal. Ex vivo cells include, for instance, primary 

25 cells (e.g., cells that have recently been removed from a subject and are capable of 
limited growth in tissue culture medium), and cultured cells (e.g., cells that are 
capable of long term culture in tissue culture medium). The cell is a eukaryotic cell, 
preferably, a vertebrate cell, most preferably, a mammalian cell. Examples of 
mammalian cells include human, as well as other animals (for instance, mice, rats, 

30 or hamsters) that can be used as animal models in the study of the protective ability 
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of polypeptides of the present invention. 

Cells that are useful in the methods described herein vary depending on the 
pathogen or toxin used. The cell chosen for use with a particular pathogen or a 
particular toxin is one for which the pathogen or toxin is cytotoxic, preferably, 

5 cytolytic. Which cells are appropriate for use with a particular pathogen or a 
particular toxin is known to the art, and can be chosen by a person having skill in 
the art. Examples of some pathogen/cell combinations include Bacillus anthracis 
and the macrophage cell line J774A.1 (Friedlander et al., Infect Immum, 61, 245- 
252 (1993), and Hanna et al., Mol Biol, Cell 3, 1269-1277 (1992)), Rift Valley 

0 fever virus and Vero cells, Venezuelan equine encephalitis and Vero cells, and 
Rickettsia spp. and primary chick embryo cells (Walker and Cain, Lab Invest, 43, 
388-396 (1980)) and human endothelial cell culture (Walker et al, Fed. Proa, 40, 
72A(1981)). 

In one aspect of the invention, the polypeptide of the present invention can 
5 be expressed in a cell that is to be exposed to a pathogen or a toxin. Typically, the 
polypeptide will be present in a cell in a vector as described herein, preferably 
inserted into the chromosome. In another aspect, the polypeptide of the present 
invention can be introduced to the cell. Typically, such a polypeptide will contain a 
cell-permeant region that will allow the polypeptide to traverse the cell membrane. 
In this aspect, the polypeptide can be introduced to the cell before, at the same time, 
or after exposing the cell to the pathogen or toxin. 

After protective polypeptides are identified by these methods, the 
polynucleotides encoding the protective polypeptides can be cloned and expressed. 
The expressed protective polypeptides may be isolated using chromatographic 
methods known to the art, preferably using cation exchange and size exclusion 
chromatography. Isolated protective polypeptides can then be introduced to cells to 
verify that the polypeptides are protective. Optionally, the ability of the isolated 
protective polypeptides can be introduced to animals to evaluate whether the 
protective polypeptides will protect against challenge with a pathogen. 
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The present invention is illustrated by the following examples. It is to be 
understood that the particular examples, materials, amounts, and procedures are to 
be interpreted broadly in accordance with the scope and spirit of the invention as set 
forth herein. 

5 

EXAMPLES 

Example 1 
Crystalization of the CCD protein 

10 

Cloning, expression and purification of CCD. cDNAs encoding the capsid 
C-terminal domain (CCD; capsid residues 1 18-275) of the VEE virus were 
amplified from purified pUC18 plasmid containing the Trinidad donkey strain of 
VEE virus (Kinney et al., Virology, 110, 19-30 (1989)) using PCR and the primers 

15 5' GGGAATTCCATATGGTCATGAATTGGAATCTGACAAG (SEQ ID NO:9) 
and 5 -GAATTCGGATCCTCATTACCATTGCTCGCAGTTCTCCGGAGT (SEQ 
ID NO: 10). The PCR products were each digested with Ndel and BamHI restriction 
enzymes (Promega, Madison, WI) and subcloned into pET30 expression vectors 
(Novagen, Madison, WI). 

20 The tat-CCD construct was produced by PCR using CCD in the pET30 

vector and the primers N-TATCCD 

(5'ATGTACGGTCGTAAAAAACGTCGTCAGCGTCGTCGTGTCATGAAATTG 
G AATCTG AC A3 ' ) and CBAM-VEE 

(5 5 GAATTCGGATCCTC ATT ACC ATTGCTCGC AGTTCTCCGGAGT3 ' ) . The 
25 PCR product was phenol-chloroform extracted and was ligated into the pETBLUE 
vector. It was then transformed into NovaBlue Singles (Novagen) and plated on 
LB-Bluogal-IPTG-carbenicillin-tetracycline plates. White colonies were selected 
for amplification, plasmid purification, and sequencing. The tat-CCD cDNA 
sequence was determined and is depicted in Figure 3. 
30 The VEE virus CCD was expressed from BL21(DE3) cells grown at 37°C, 
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induced with 1 mM isopropyl-thio-p-D-galactoside (IPTG) for 2 to 4 hours and 
recovered from the supernatant after mild sonication of bacterial pellets in ice-cold 
lysis buffer (20 mM TrisHCl pH 7.3, 5 mM DTT, 150 mM NaCl, 5% glycerol, 2 
mM EDTA). The CCD was purified using cation exchange HPLC (Poros 20 SP 
5 column, Perseptive Biosystems, Farmingham, MA) in chilled MOPS buffer (20 mM 
MOPS pH 7.3, 5 mM DTT) and size exclusion chromatography (Biosep SEC- 
S3000, Phenomenex, Torrence, CA) in chilled Tris buffer (10 mM Tris pH 7.3, 150 
mM NaCl, 10 mM DTT). Purified CCD was concentrated to 30 mg/ml using 
ultrafiltration (Millipore, Bedford, MA). 

10 Crystallization and data collection. Crystals of the CCD were grown in 

hanging drops at 18°C using 5 of protein solution (10 mg/ml) mixed with an 
equal volume of precipitant. Two different crystallization conditions produced 
crystals that diffract to high resolution. Condition one, producing crystal type I, was 
24% (w/v) polyethylene glycol (PEG) 1000, 0.1 M ammonium sulfate, 5 mM urea 

15 and 0. 1 M Bis-Tris buffer pH 6.5. Condition two, producing crystal form II, was 
20% (w/v) PEG 4000, 0.2 M MgCl 2 buffered in Bis-Tris at pH 6.5. PEG 4000 could 
not be replaced with PEG 1000 in condition two. CCD crystals in both conditions 
grew to approximately 0.2 mm x 0.2 mm x 0.5 mm size within 10 days. 

Crystals were transferred to precipitant solution containing 10% 2-methyl 

20 dioxypentane (MPD) as cryo-protectant. X-ray diffraction data were collected at 
105 K using MacScience Dip 2030 imaging plate detector on a rotating anode 
generator running at 50 kV and 90 mA. All the data were processed using the 
DENZO and SCALEPACK programs (Otwinowski & Minor, Methods Enzymol, 
276, 307-326 (1997)). Details of the data collection and quality of the data were 

25 given in Table 1. 
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TABLE L Data collection statistics for VEE virus CCD crystals. 



Crystal form I 



Crystal form D 




P2]2j2j 
69.3, 75.2, 94.7 
105 
180 



30-2.3 
168,290 
22,453 
99.0 
6.9 
2.38-2.3 
97.9 
30.9 



P2,2,2, 
72.0, 76.0,90.1 
105 
200 



30-2.45 
172,603 
18,598 

99.2 

7.0 
2.54-2.45 

99.8 

26.2 



Molecular replacement solution. The calculated Matthew coefficient 
implied, and subsequent molecular replacement solutions (Navaza, J. Acta 
Crystallog., A50, 157-163 (1994)) confirmed, that there are three monomers (A, B, 
and C) per asymmetric unit. The CCD structure in crystal type I was solved by 
molecular replacement with the AMoRe program (Navaza, J. Acta Crystallog,, A50, 
157-163 (1994)) contained in the CCP4 suite. The C-terminal domain from Sindbis 
virus capsid protein (PDB code 1WYK; Choi et aL, Nature, 354, 37-43 (1991)) was 
used as a search model. Two prominent peaks were obtained in the cross-rotation 
search calculated using reflection data from 10 A to 3 A and with an integration 
radius of 30 A. The largest peak in the cross-rotation function gave rise to the 
largest peak in the translation function using P2 1 2 1 2 1 as the space group. The 
position of this monomer was fixed and the translation of the second monomer 
determined. This translation was found with the second highest peak in the rotation 
function. With the first and second monomers fixed, the translation function for the 
third monomer was obtained. This translation vector corresponded to the 39th 
rotation function peak. After rigid body refinement performed in AMoRe, the 
crystallographic R-value was 44% with a correlation coefficient of 0.50 calculated 
with data between 8 A and 3 A resolution. 
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Structure refinement. Refinement of the CCD structure in crystal type I was 
performed using the Crystallography and NMR System package (CNS version 0.5; 
Brunger et al., Acta Crystallogr, D54, 905-921 (1998)), with maximum likelihood 
5 target function. Bulk solvent correction and overall anisotropic temperature factors 
were applied during refinement. The initial model was subjected to torsion angle 
dynamics simulated annealing with a starting temperature of 4,000K and a cooling 
step interval of 25K. Following simulated annealing, repeated rounds of Powell 
positional refinement followed by individual B-factor refinement and model 

10 building with the graphic package O (Jones et aL, Acta Crystallog. , A47, 110-119 
(1991) were performed. The VEE capsid sequence was incorporated into the model 
using sigma-A-weighted Fo-Fc electron density maps (Read, Acta Crystallog., A42, 
140-149(1986)). The first six amino-terminal residues in monomers A and B were 
also built into sigma-A-weighted Fo-Fc electron density maps. Included in the 

15 crystal type I structure are seven urea molecules and two sulfate ions. The 
possibility that these are water molecules was eliminated by examination of 
difference Fourier maps. These difference maps, calculated with water in place of 
urea or sulfate, had significant positive density on and around the replaced water 
molecules. Details of the refined model are given in Table 2. 

20 Refinement of the CCD structure in crystal type II was initiated with CNS 

rigid body refinement. The starting model was the refined crystal type I structure. 
Following rigid body minimization, the type II model was refined as described 
above. No non-crystallographic symmetry restraints were applied during structure 
refinement. For both structures, 5% of the diffraction data were excluded from the 

25 refinement calculations and used to calculate the free R- value (Brunger, Nature, 
355, 472-474(1992)). The final R-factors for both structures are given in Table 2. 
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TABLE 2. Refinement statistics for VEE virus CCD structures. 



10 



15 



20 



Resolution range (A) 

Rays. (Rfree ) (%) 

No. of reflections (test set) 
Residues 



No. of water molecules 
No. of urea molecules 
No. of sulfate molecules 
Average B-factors (A ) 
CCD monomers 
solvent 
urea 
sulfate 
Bond length rmsd 1 (A) 
Bond angle rmsd(°) 



Crystal form I 



Crystal form II 



30-2.3 
22.8 (26.3) 
20,772 (1069) 
Molecule A: 120-275 
Molecule B: 1 19-275 
Molecule C: 124-275 
176 
7 
2 

A: 26; B: 29; C: 44 
36 
50 
76 
0.007 
1.35 



30-2.45 
21.6 (25.6) 
17,464 (896) 
Molecule A: 119-275 
Molecule B: 119-275 
Molecule C: 123-275 
223 



A:27; B: 28; C:37 
34 



0.007 
1.38 



rmsd, root mean square deviation. 

Example 2 

Cloning combinatorial adaptein libraries into packaging vectors. 



This describes the insertion of a DNA oligonucleotide, which contains a 
25 stretch of random sequence, into the DNA sequence coding for the tat-CCD protein, 
within a retrovirus packaging vector. This allows for the expression of a fusion 
protein that contains tat, CCD, and a random peptide inserted into the CCD 
sequence. A number of approaches are being used to create these adaptein 
combinatorial libraries within packaging vectors. 

30 

1. Combinatorial synthetic oligonucleotide method for cloning combinatorial 
adaptein libraries into packaging vectors. 

The tat-CCD DNA sequence was inserted into a number of retroviral 
packaging vectors, including pLPCX and pLNCX2 from Clontech (Palo Alto, CA) 
35 and pFB from Stratagene (La Jolla, CA). To insert the tat-CCD sequence into 
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pLNCX2, the tat-CCD sequence was amplified by PCR from tat-CCD in the 
pETBlue plasmid (Novagen, Madison, WI). The nucleotide sequence encoding 
CCD was amplified and ligated into pETBlue as follows. The 5' fragment of the 
tat-CCD was amplified using the primers CCDnEcoR(+) 
5 (5'AGCTAGGAATTCGGATCCCATATGTACGGTCGTAAAAAACGTC (SEQ 
TD NO: 13) and CCDnHind(-) 

(5 ' CT AGCT A AGCTTGTTCC AC ATG ACG ACTG A A AG (SEQ ID NO: 14). The 
PCR product of the 5'fragment was digested with EcoRI (New England Biolabs, 
Beverly, MA) and Hindm (New England Biolabs) and was gel purified. The 

10 CCDnEcoR(+) primer added the nucleotides encoding the tat protein to the amino 
terminal end of the CCD. The pLNCX2 vector was digested with Hindm, partially 
digested with EcoRI, the largest fragment (4.6 kb) was gel purified, ligated to the 5' 
tat-CCD PCR fragment and transformed into DH5a cells for plasmid amplification 
and subsequent purification. The 3' fragment was amplified using the primers 

15 CCDcNot(-) (5 ' CT AGCTGCGGCCGCTC ATT ACC ATTGCTCGC AGTTC (SEQ 
ID NO: 15)) and CCDcHindXho(+) 

(5 ' AGCT AG A AGCTTGGATCTTCTCTCG AGGG AGTT ACCGTGAAGT AT AC 
(SEQ ID NO: 16)), which also inserted a Hindffl/Xhol cloning site into ccd 
(between amino acids 260-262 of the full length ccd protein). The above plasmid 

20 and the tat-CCD 3 'fragment PCR product were digested with Hindm and NotI 
(New England Biolabs), were gel purified, and ligated together, thereby creating 
pLNCX2:tat-CCD that contains a Hindin/Xhol cloning site. To insert the tat-CCD 
into pFb and pLPCX, the pFb or pLPCX and pLNCX2: tat-CCD were digested with 
EcoRI and NotI followed by gel purification and ligation of tat-CCD into pFb or 

25 pLPCX. In addition, the Hindm site in these tat-CCD expression vectors was 
converted to BAMHL This was accomplished by PCR of tat-CCD using the 
primers bamccdr 

(5'GATCCTCGAGAGAAGATCCGGATCCGTTCCACATGACGACTGAAAGG 
GCT (SEQ ID NO: 17)), which converted the Hindm site to BamHI and ccd5rl 
30 (5 ' G ATCG A ATTCC ACC AGC AG A ATCG AC ATATGT ACGGTCGT A A A A A AC 
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GTCG (SEQ ID NO: 18)), which inserted a murine leukemia virus ribosomal 
binding site just 5' to the tat-CCD start site. This PCR product and pFb:tat-CCD 
were digested with EcoRI and Xhol (New England Biolabs) and were ligated 
together to from pFb:tat-CCD:Bam. In addition, the tat-CCD was excised from 
5 PFb:tat-CCD:Bam using Sail (Boehringer Mannheim, Indianapolis, IN) and Not 1 
and was inserted into pLPCX:tat-CCD that had been digested with Xho I (New 
England Biolabs) and Not I, thereby forming pLPCX:tat-CCD:BAM. 

The random library sequence is inserted into the multiple cloning site in the 
ccd sequence of the above constructs as follows. For each library, three oligos are 

10 5' phosphorylated and PAGE purified (Biosource International, Camarillo, CA). 
These included LIB (5 ' AGCTTTCCGGTGGT(NNK)mGGTGGTTCCC (SEQ ID 
NO: 19)), Link P3 (5'ACCACCGGAA (SEQ ID NO:20)), Link P4 
(5 ' TCG AGGGAACC ACC (SEQ ID NO:21)), and Link P5 
(5 ' AGCTGGGAACC ACC (SEQ ID NO:22)) ? where m = the number of amino 

15 acids to be in the random library, N= A,T,C,G, and K= T,G. To create a construct 
to insert into the Hindffl/Xhol site of the tat-CCD expression vectors, LIB, Link P3 
and Link P4 are annealed together. The binding of Link P3 to the 5' end of LIB 
created a Hindm cohesive end, while the binding of Link P4 to the 3' end of LIB 
created an Xhol cohesive end. Another library construct for insertion into the 

20 Hindm site of the expression vector alone is produced by annealing LIB to Link P3 
and Link P5. Lastly, since the tat-CCD expression plasmid was also engineered to 
contain a BamHI site in place of the Hindm, a construct is generated to insert into 
the BamHI and Xhol site. The annealed oligos for this construct were LIB-BAM 
(5 ' G ATCCTCCGGTGGT(NNK)mGGTGGTTCCC (SEQ ID NO:23)), Link P6 

25 (5 ' ACC A ACCGG AG (SEQ ID NO:24)), and Link P4 where Link P6 bound to the 
5' end of LIB-BAM and formed a BamHI cohesive end and Link P4 bound to the 3' 
end of LIB-B AM and formed an Xhol cohesive end. 

Additional approaches to constructing the insert containing the library are 
also being used. One of these approaches involves annealing a negative strand of 

30 LIB (termed LIB r/c) to LIB itself. The LIB r/c sequence was 
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(5TCGAGGGAACCACC(MNN)mACCACCGGAG (SEQ ID NO:25)), where M= 
C, A. When LIB and LIB r/c were annealed, cohesive ends for BamHI and Xhol 
are formed. Another approach is to use Sequenase V 2.0 (USB, Cleveland, Ohio) to 
synthesize the negative LIB strand. The oligos for this are LIBSEQBAM 
(5 ' GC ACGG ATCCTCCGGTGGT(NNK)oGGTGGTTCCCTCG AG ATCG (SEQ 
ID NO:26)) and SEQBAM Rev (5 ' CG ATCTCGAGGGAACC ATC (SEQ ID 
NO:27)). This sequenase product is then digested with BamHI (Promega, Madison, 
WI), and Xhol for insertion into the tat-CCD:BAM expression vectors. 

The above constructs are inserted into the tat-CCD expression vectors as 
follows. The tat-CCD or tat-CCD:BAM expression vectors are digested overnight 
at 37 °C with the appropriate enzymes (HindDI, BamHI, and/or Xhol) according to 
the manufacturers' protocol. The restriction digest products are electrophoresed on 
a 1% agarose gel containing 0.1 mg/ml ethidium bromide. DNA is visualized by 
ultraviolet light and the appropriate band is excised and gel purified using a 
QIAquick gel extraction kit (Qiagen, Valencia, CA). The purified tat-CCD 
expression vector is then incubated overnight at 16°C with the appropriate annealed 
constructs in the presence of 10X DNA ligase buffer (New England Biolabs), 400 U 
of T4 DNA Ligase (New England Biolabs), and dH 2 0 to a final volume of 20 \xl In 
addition, the ligated constructs that contained single stranded DNA are subjected to 
sequenase treatment (0.2 XJ/\xl sequenase, 0.2 mM dNTPs (Sigma, Saint Louis, 
MO), and 5X sequenase buffer (USB), for 1 hour at 37°C). The ligated library 
construct and tat-CCD expression vector are transformed into ElectroTen-Blue 
electroporation competent cells (Stratagene) or XLIO-Gold ultracompetent cells 
(Stratagene) according to the manufacturer's protocol. A small aliquot of cells are 
plated on LB-carbenicillin agar to determine transformation efficiency. The 
bacteria containing the library are then expanded and the DNA isolated by standard 
methods. 

2. PCR-based method for cloning a combinatorial adaptein libraries into packaging 
vectors. 
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Another strategy for constructing the adaptein library was to use the 
polymerase chain reaction (PCR) in conjunction with a degenerate oligonucleotide 
primer to introduce the random sequences into the retroviral packaging vector. A 
partially degenerate primer of sequence 5'- 
5 CCAGGCAAGCTTTCTGGANNKNNKNNKNNKNNKNNKGGATCTCTCGAG 
GGAGTTACC (SEQ ID NO:28) was used in combination with a normal 
downstream primer of sequence 5 ' -TGGTTCTCTAG A A ACTGCTG A (SEQ ID 
NO:29) to amplify the C-terminus of the CCD and the downstream vector 
sequences. The 5' and 3' ends of the degenerate primer are designed to anneal to 

10 the CCD sequence while the degenerate portion (NNKNNKNNKNNKNNKNNK 
(SEQ ID NO:30)) inserts the adaptein library into the amplicon. For the PCR 
reaction, plasmid pLPCX DNA (50 ng-20 pg), the 5' degenerate primer (100 p,g), 
the 3' primer (100 [ig\ a 1/10 volume of lOx Taq buffer, dNTP (20 mM), Mg 2+ (25 
mM), and Taq enzyme (5U), are mixed with H 2 0 to adjust the final volume to 

15 100|al. The PCR conditions are: 95°C denaturation for 2 minutes while adding the 
Taq enzyme, then 30 cycles as follows: 95°C denaturation for 30 seconds, 59°C 
primer annealing for 30 seconds, extension at 72°C for 30 seconds, with a final 
extension at 72°C for 10 minutes. The product of the PCR is analyzed by agarose 
gel electrophoresis, and the correct band is excised and purified using a standard 

20 method (Qiagen kit). The amplicon is analyzed using automated sequencing 
(Applied Biosystems 377 sequencer and BigDye sequencing kit) and an internal 
primer to ensure that a mixture of 2 or 4 nucleotides is present at each degenerate 
site introduced by the degenerate primer. Then the amplicon DNA and plasmid 
DNA (purified using the Qiagen miniprep kit) are digested overnight at 37 °C with 

25 Hind III and Xba I in buffer 2 from Gibco, lU/ug of DNA. The amplicon and 
plasmid DNA are analyzed by agarose gel elecrophoresis and the correct, linear 
fragments are excised and purified as described above. The amplicon and plasmid 
DNA are then ligated at a ratio of 1:3 vectoninsert, overnight at 14°C using T4 
ligase (Gibco-BRL). The ligation reaction is then cleaned on a spin column 
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(Pharmacia) before being transformed by electroporation in XB1-1 Blue competent 
E. coli (Stratagene). 

Production of Murine Leukemia virus (MLV) library stocks used to deliver adaptein 
5 libraries to target cells 

Recombinant replication deficient Murine Leukemia viruses (MLV) were 
used to deliver and express the adaptein library in cells. This virus permits 
introduction and expression of the adaptein library in a broad range of cell types. 
The tat-CCD gene was inserted into the packaging vector pLPCX between the BglTL 

10 and Notl endonuclease restriction sites, or into the packaging vector pLNCX2 
between the EcoRl and Notl endonuclease restriction sites. The vector harbors a 
cDNA copy of an MLV provirus but lacks the structural genes gag, pol and env. 
What remains is the psi sequence required for efficient packaging of viral RNA and 
the flanking long terminal repeats (LTR) that function in chromosomal integration 

15 and transcription. This vector is bicistronic; the 5' LTR drives expression of a gene 
encoding resistance to puromycin and the human cytomegalovirus (CMV) 
immediate early promoter (P cmm ) drives adaptein library gene expression. 

The adaptein library-containing vector DNA is isolated from E. coli bacteria 
by standard techniques and purified by equilibrium density gradient separation 

20 using cesium chloride to form the gradient and following standard techniques. The 
DNA (adaptein library) is dissolved at 1 mg/ml in ImM EDTA, 10 mM Tris-HCl, 
pH 8.0 (TE) and stored at -80°C until required. To produce the MLV encoding the 
adaptein library, HEK 293 human fibroblasts are grown to 80% confluence on 
plastic plates in Dulbecco's Modified Eagle Medium (DMEM) supplemented with 

25 10% fetal calf serum (culture medium) and with the bacterial antibiotics 

streptomycin and penicillin. The cells are simultaneously transfected with vector 
DNA encoding the adaptein library (25 jig), the MLV structural genes gag-pol (25 
|ig) and the Vesicular Stomatitis virus glycoprotein (VSV-G; 5 |ig). Transfection is 
conducted according to the methods of Chen and Okayama (MoL Cell Biol, 7, 

30 2745-5272 ( 1 987) and used calcium phosphate to form a precipitate with the DNA 
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that is efficiently taken up by this cell type and the encoded genes expressed. The 
DNA is mixed with 0. 1 mM EDTA, 1 mM Tris-HCl, pH 8.0 in purified water to 
give a total volume of 1.25 ml. To this 1.25 ml of 50 mM BES, 280 mM NaCl, 1.5 
mM Na 2 HP0 4 , pH 7.0 is added and mixed. Calcium chloride (2 M, 0.16 ml) is 
5 added dropwise while mixing and then incubated for 10 minutes at room 

temperature. This is then applied to cells with 2.5 ml of the mixture being used per 
15 cm diameter plastic dish containing 25 ml of culture medium. The following day 
the culture medium is removed and cells washed with 7 ml of fresh medium and 
replaced with 20 ml of culture medium. To generate sufficient complexity in the 

10 library this procedure is repeated 50 times to yield 1 x 10 8 transfected cells 
expressing each library member. 

To isolate virus particles containing the adaptein library, the culture 
supernatants are pooled on the second day after transfection and filtered through a 
0.45 |im cellulose acetate filter to remove cells and debris. The MLV-adaptein 

15 library-containing medium is then stored at -80°C in 5 ml aliquots or used 

immediately. The number of virus particles per ml is enumerated by incubation of 
HEK 293 cells using serial 10-fold dilutions of the filtered medium and after two 
days exposing the cells to 1 jig/ml of puromycin. Colonies formed after 5 days gave 
the initial number of infectious particles. When done using controls (viruses 

20 containing the CCD alone), approximately 80% of cells infected at MOI 

(multiplicity of infection ) of 1.0, and the titers were determined to be greater than 
> 10 8 plaque forming units per ml. 

C Expression of adaptein libraries in target cells 

25 The MLV encoded adaptein library is introduced into target cells by seeding 

cells at 20% confluence on plastic dishes in their required medium and after 
overnight growth, reaching a confluence of 40% the virus containing culture 
medium is applied to give a ratio of virus to cells of 1.5:1. This ensures that most 
target cells received one virus. Two days later the cells are assayed for adaptein 

30 library expression or challenged with the pathogenic agent. 



30 



Example 3 

Challenge of adaptein library-containing cells with the pathogen Rift Valley fever 

virus (RVFV). 



5 RVFV is highly cytopathic in virtually all types of cell cultures that it 

infects. Initially the live-attenuated vaccine strain MP- 12 (Peters and Slone, /. Med. 
Virol. , 10, 45-54 (1982)) will be used. This strain, which is classified as Biosafety 
Level 2 (BSL-2) and has been well characterized at both the molecular and 
biological levels, will be used as a model system for pathogenic wild-type strains of 

10 RVFV, which is classified at BSL-4. Peripheral inoculation of mice leads to a 
viremia followed by death from hepataic necrosis (Peters and Anderson, In: 
Contributions to Epidemiology and Statistics, vol. 3 (Goldblum et aL, eds.), S. 
Karger, Basel, pp. 21-41 (1981); Peters et aL, Antiviral Research, 6, 285-297 
(1986)). This animal model is a commonly accepted model for human disease, and 

15 has been used for many studies with RVFV. Partial protection leads to encephalitis. 
Inbred rats have been used as models of encephalitis caused by RVFV (Peters and 
Slone, /. Med. Virol, 10, 45-54 (1982)). The assays described herein to identify 
adapteins that protect cells and animals from challenge with RVFV can be modified 
for use with different pathogens and different cells using methods known to the art. 

20 Vero cells, which are uniformly susceptible to cytolytic infection with 

RVFV, are in challenge experiments. Vero cells expressing an adaptein library are 
challenged at a multiplicity of infection (MOI) of 0.1. After incubation at 37°C for 
48 hours (or 35°C for MP-12), 100% of cells normally show cytopathic effects 
(CPE). However, cells expressing a protective adaptein survive in the presence of 

25 viral replication in surrounding cells and repeated challenge from virus in the cell 
culture supernatant. 

Cells expressing an adaptein that protects against RVFV challenge survives 
and continues replicating, producing small cell colonies on the plastic surface after 
CPE is complete in surrounding, unprotected cells. PCR- or cloning-based protocols 

30 are used to identify the protective adaptein sequence from each of these colonies. If 
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evidence of a mixed adaptein population is found, (for instance, two or more 
sequences), several DNA clones of a PCR amplicon will be sequenced to assess the 
sequence diversity in the population. Protective adapteins are recloned into 
retroviral vectors, and cells infected with these single-adaptein retroviruses are 
challenged with RVFV to ensure that most to all infected cells are protected. 
Protection is assessed by continued cell proliferation, reduction or lack of infectious 
virus in the culture supernatant, and lack of detectable viral RNA in cells compared 
to control virus-infected cultures. 

After confirming the efficacy of retrovirus containing a single protective 
adaptein, other cell types (e.g., hepatocyte, neuronal, primary human macrophages 
and monocytic cell lines) are tested to determine if protection is cell-type specific. 
Cell protection after challenge is assayed by decreased viral replication compared to 
negative controls, reduction or lack of infectious virus in the culture supernatant, 
and delayed or reduced CPE. 

The initial RVFV selection procedures in cell culture may overwhelm cells 
with the highly virulent RVFV. If no resistant cells are detected, the stringency of 
the challenge can be modified to increase the likelihood of cell survival, including: 
reducing the MOI of the initial RVFV challenge to delay viral replication; using 
MP-12 at partially non-permissive temperatures, e.g., 37-39°C; periodically 
replacing the cell culture medium to prevent continuous virus challenge of the 
surviving cells; or adding neutralizing antibody to the supernatant to reduce 
continuous challenge of the surviving cells. Alternatively, other less stringent 
Bunyaviridae viruses, such as Punta Toro or LaCrosse, can be used for initial 
implementation. 

Example 4 

Evaluate efficacy of recombinant anti-RVFV adapteins in cell culture and animal 

models. 

Protective adapteins are subcloned into bacterial expression vectors, 
expressed, and purified. Minimum adaptein toxicity levels will be established using 
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standard protocols. Adaptein entry into cultured cells is confirmed by 
immunofluorescence microscopy of Tat-CCD as described above. Cytoplasmic 
immunofluorescence indicates that the agent has reached the desired compartment. 
To assess the distribution of each protective adaptein in vivo, 0.1, 1, 10 and 100 
5 nanomoles (nmol) of it is inoculated intraperitoneally into mice, and at 2, 4, 12, 24 
and 38 hours, the brain, liver, heart muscle, lung, spleen and peritoneal lymph nodes 
are dissected. Confocal fluorescence microscopy of deparaffnized sections are used 
to assess the targeting of the adapteins to appropriate sites, especially the 
endothelial and hepatic tissues (believed to determine the course of RVF 

10 pathogenesis) and the brain (where infection is associated with encephalitis). 

Next, the ability of cell-permeant protective adapteins to shield cultured 
cells and animals against RVFV challenge is tested. Cells will be pretreated with a 
range of subtoxic adaptein concentrations before challenge with RVFV. CPE is 
monitored and compared with controls; viral replication is assessed by determining 

15 the infectious titers in the supernatant after various incubation times (12, 24, 36, 48 
hours). Adapteins showing efficacy in cell culture are tested in NIH Swiss mice 
treated with cell-permeant adapteins at subtoxic intraperitoneal and intravenous 
doses before virus challenge; aerosol challenge is performed on compounds 
showing initial efficacy. Based on previous studies with Tat-p-Gal (Schwarze et aL, 

20 Science, 285, 1569-1572 (1999), it is expected that doses in the range of 1-500 |ig 
will be administered initially, with lethality assessed using 5 animals per group. 
Adapteins with antiviral activity are then retested in additional mice and full 
pathological studies of these animals performed. These include complete necropsies 
with conventional histological techniques to assess tissue pathology, determination 

25 of viral load in various organs such as lymph nodes and brain by plaque assays, and 
immunohistology for localization of viral antigen in various organs. Viremia is 
monitored and average survival times of virus-infected animals is compared. Post- 
exposure efficacy and combinations of partially protective adapteins are also 
evaluated. 
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Example 5 

Evaluate shared protective action of anti-RVFV adapteins by challenging cell and 
animal models with diverse bunyaviruses. 
Adapteins that protect cells against challenge by RVFV are tested against 
5 members of all four genera of Bunyaviridae: Bunyavirus (LaCrosse, Bunyamwera), 
Nairovirus (Crimean Congo HF), Phlebovirus (RVF plus Punta Toro, Sandfly 
Sicilian), and Hantavirus (Hantaan, Sin Nombre). Testing of these agents is done in 
cell culture; animal tests, similar to those described above, are undertaken. 

10 Example 6 

Recombinant carrier protein crosses cell membranes and accumulates within the 

cytoplasm. 



We examined the ability of purified recombinant Tat-CCD carrier protein to 
15 cross cell membranes. Recombinant tat-CCD was isolated from BL21(DE3) cells 
grown at 37°C, induced with 1 mM isopropyl-thio-p-D-galactoside (IPTG) for 2-4 
hours and recovered from the supernatant after mild sonication of bacterial pellets 
in ice-cold lysis buffer (20 mM Tris-HCl pH 7.3, 5 mM DTT, 150 mM NaCl, 5% 
glycerol, 2 mM EDTA). The tat-CCD was purified using cation exchange HPLC 
20 (Poros 20 SP column, Perseptive Biosystems) in chilled MOPS buffer (20 mM 
MOPS pH 7.3, 5 mM DTT) and size exclusion chromatography (Biosep SEC- 
S3000, Phenomenex) in chilled Tris buffer (10 mM Tris pH 7.3, 150 mM NaCl, 10 
mM DTT). Purified tat-CCD was concentrated to 30 mg/ml using ultrafiltration 
(Amicon). 

25 Polyclonal antibody to CCD was made by injecting rabbits with CCD in 

Freund's adjuvant. 

Cultured Vero cells were incubated with 1 |iM Tat-CCD protein for 20 
minutes at 37°C, then extensively washed in PBS buffer to remove extracellular 
protein. Cells were fixed in glutaldehyde, permealized with Triton X-100, and 

30 incubated with anti-CCD polyclonal antibody, and then with FFTC-conjugated anti- 
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rabbit secondary antibody. Cells were examined by immunofluorescence 
microscopy. Tat-CCD clearly localized to the cytoplasm of Vero cells. No nuclear 
localization was observed for Tat-CCD. In addition, no immunofluorescence was 
observed in control cells incubated with similar concentrations of recombinant CCD 
5 protein. 



The complete disclosure of all patents, patent applications, and publications, 
and electronically available material (e.g., GenBank amino acid and nucleotide 
sequence submissions, and computer programs) cited herein are incorporated by 
10 reference. The foregoing detailed description and examples have been given for 
clarity of understanding only. No unnecessary limitations are to be understood 
therefrom. The invention is not limited to the exact details shown and described, for 
variations obvious to one skilled in the art will be included within the invention 
defined by the claims. 

15 All headings are for the convenience of the reader and should not be used to 

limit the meaning of the text that follows the heading, unless so specified. 
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SEQ ID NO:2 


Cell-permeant polypeptide 




SEQ ID NO:3 


Cell-permeant polypeptide 




SEQ ID NO:4 


Cell-permeant polypeptide 




SEQ ID NO:5 


Cell-permeant polypeptide 




SEQ ID NO:6 


Nucleotide sequence encoding tat-CCD 
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SEQ ID NO:7 


Amino acid sequence of tat-CCD 




SEQIDNO:9-10 


Primer 




SEQ ID NO: 11 


A variable region amino acid sequence 




SEQ ID NO: 13-30 


Primer 




SEQIDNO:31 


Adaptein- 1 nucleotide sequence 
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SEQ ID NO:32 


Adaptein-2 nucleotide sequence 
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SEQ ID NO:33 Adaptein-1 amino acid sequence 

SEQ ID NO: 34 Adaptein-2 amino acid sequence 
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